skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "Zhou, Yuansheng"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Abstract The t-distributed stochastic neighbor embedding (t-SNE) method is one of the leading techniques for data visualization and clustering. This method finds lower-dimensional embedding of data points while minimizing distortions in distances between neighboring data points. By construction, t-SNE discards information about large-scale structure of the data. We show that adding a global cost function to the t-SNE cost function makes it possible to cluster the data while preserving global intercluster data structure. We test the new global t-SNE (g-SNE) method on one synthetic and two real data sets on flower shapes and human brain cells. We find that significant and meaningful global structure exists in both the plant and human brain data sets. In all cases, g-SNE outperforms t-SNE and UMAP in preserving the global structure. Topological analysis of the clustering result makes it possible to find an appropriate trade-off of data distribution across scales. We find differences in how data are distributed across scales between the two subjects that were part of the human brain data set. Thus, by striving to produce both accurate clustering and positioning between clusters, the g-SNE method can identify new aspects of data organization across scales. 
    more » « less
  2. Rueppell, Olav (Ed.)
    Animals use odors in many natural contexts, for example, for finding mates or food, or signaling danger. Most analyses of natural odors search for either the most meaningful components of a natural odor mixture, or they use linear metrics to analyze the mixture compositions. However, we have recently shown that the physical space for complex mixtures is ‘hyperbolic’, meaning that there are certain combinations of variables that have a disproportionately large impact on perception and that these variables have specific interpretations in terms of metabolic processes taking place inside the flower and fruit that produce the odors. Here we show that the statistics of odorants and odorant mixtures produced by inflorescences ( Brassica rapa ) are also better described with a hyperbolic rather than a linear metric, and that combinations of odorants in the hyperbolic space are better predictors of the nectar and pollen resources sought by bee pollinators than the standard Euclidian combinations. We also show that honey bee and bumble bee antennae can detect most components of the B . rapa odor space that we tested, and the strength of responses correlates with positions of odorants in the hyperbolic space. In sum, a hyperbolic representation can be used to guide investigation of how information is represented at different levels of processing in the CNS. 
    more » « less